Commentary on Kaplan and Kay

نویسنده

  • Graeme D. Ritchie
چکیده

To appreciate this article fully, it is essential to understand the historical context into which it fits, and which it has to some extent created. Although formally published for the first time here, it is already an extremely influential and classic piece of work. Finite-state machines, in one form or another, have been used for the description of natural language since the early 1950s, with the extension to transducers appearing in the 1960s. After Chomsky's stern condemnation of the adequacy of finite-state machines for describing sentence structures, they virtually disappeared from mainstream theoretical linguistics. Within computer science, they continued to be a standard formalism, although transducers were not accorded the same detailed algebraic attention as simple automata. Phonologists, meanwhile, were inventing a variety of rule mechanisms that were (with rare exceptions) only partly formalized. Superficially, most of these systems (as typified by those of Chomsky and Halle) appeared to have little to do with finite-state machines. Indeed, their notations tended to suggest that the rules had much more than finite-state power. Kaplan and Kay have integrated these two streams of work--algebraic treatment of automata in computer science, and phonologically-motivated formalisms within linguistics--and their results should feed back productively into both subfields. The framework they have established allows the comparison of different competing formalisms in a rigorous manner, and permits the exploration of the formal limitations or capabilities of rule notations that were previously more like expository devices than formally defined systems. What may not be clear to the casual reader is that this work has been developed over many years, and early versions of it have already escaped into the computational linguistics community in less prominent forums. In this way it has already affected the course of research into phonological/morphological formalisms. Perhaps the most notable (and in its turn, influential) development has been Koskenniemi's two-level morphology, which has been successfully applied to the morphology of a very wide number of languages. Koskenniemi's ideas are a direct development of Kaplan and Kay's, as explained in Section 7 of the paper here. The theory of regular relations and finite-state transducers should not be viewed as a mere re-formalisation of 1960s linguistics. As well as its relevance to the two-level model, Kaplan and Kay suggest that it may also throw light on the formal properties of autosegmental phonology. Although the ideas were first circulated about 15 years ago, they are still of central relevance to computational phonology today.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating Finite-state Technology with Deep LFG Grammars1

Researchers at PARC were pioneers in developing finite-state methods for applications in computational linguistics, and one of the original motivations was to provide a coherent architecture for the integration of lower-level lexical processing with higher-level syntactic analysis (Kaplan and Kay, 1981; Karttunen et al., 1992; Kaplan and Kay, 1994). Finite-state methods for tokenizing and morph...

متن کامل

Commentary on Kaplan and Kay

Anyone with a fundamental interest in morphology and phonology, either from a scientific or a computational perspective, will want to study this long-awaited paper carefully. Kaplan and Kay (henceforth K&K) announce two goals: "to provide the core of a mathematical framework for phonology" and "to establish a solid basis for computation in the domain of phonological and orthographic systems." T...

متن کامل

Twenty-five years of finite-state morphology

Twenty-five years ago in the early 1980s, morphological analysis of natural language was a challenge to computational linguists. Simple cutand-paste programs could be and were written to analyze strings in particular languages, but there was no general language-independent method available. Furthermore, cut-and-paste programs for analysis were not reversible, they could not be used to generate ...

متن کامل

A General Computational Model for Word-Form Recognition and Production

A language independent model for recognition and production of word forms is presented. This "two-level model" is based on a new way of describing morphological alternations. All rules describing the morphophonological variations are parallel and relatively independent of each other. Individual rules are implemented as finite state automata, as in an earlier model due to Martin Kay and Ron Kapl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Linguistics

دوره 20  شماره 

صفحات  -

تاریخ انتشار 1994